1/5 



100 



collect and store in a database 
historical premium and loss data 
for plurality of policyholders 
representing a plurality of 
insurance companies' data 
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identify external data sources, 

each external data source having 
a plurality of individual variables 
and each individual variable 
having at least two values 
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normalize the historical premium 
and loss data using well known 
actuarial and statistical 
transformations 



calculate a loss ratio for each 
policyholder using the normalized 
historical premium and loss data 
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Fig. 1B 



calculate a cumulative loss ratio 
for a defined group of 
policyholders 



perform a statistical analysis to 
identify significant statistical 
relationships between the 
cumulative loss ratio for a defined 
group of policyholders and 
external variables 



use experienced actuaries and 
statisticians ("reviewers") to 
identify and select individual 
external variables to be used for 
generating the predictive statistical 
model 



have reviewers compare individual 
external variables and eliminate 
(as necessary) one variable of a 
pair of highly cross-correlated 
individual external variables 
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group the Consortium data base 

into three data sets, namely, 
training data set, test data set and 
validation data set for use in 
constructing the predictive 
statistical model 



use the training data sets to 
construct the predictive statistical 
model 



at certain points during the 
development of the predictive 
statistical model in step 204A, use 
the test data set to refine and 
perfect the predictive statistical 
model 



use validation data set to validate 
the predictive statistical model by 
calculating a score for each record 
in the validation data set 
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Fig. 2B 
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From Fig. 2A 



attach the score to each record in 
the validation data set and sort by 
score in plurality of bins of equal 
size (for example, ten regions of 
equal size, i.e., deciles) 
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calculate the cumulative loss ratio 
of each decile using the work data 
calculated in step 108 
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calculate the loss ratio relativity of 
each decile (i.e., the cumulative 
loss ratio of the decile minus the 
cumulative loss ratio of the entire 
validation data set) divided by the 
cumulative loss ratio of the entire 
validation data set 
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use predictive statistical model to 
score new policyholders and 
determine future profitability as 
such score relates to the loss ratio 
relativity of the validation data set 
which provides an estimation of 
the future profitability 
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Fig. 3 



